
Abstract 



Computationally inexpensive algorithm for detecting of dispersed transients has been developed using Cu- 
mulative Sums (CUSUM) scheme for detecting abrupt changes in statistical characteristics of the signal. 
The efficiency of the algorithm is demonstrated on pulsar PSR J0835-4510. 



Transient radio sky or time- variable radio sources in space have been recognized as one of the key science 
drivers for the Square Kilometre Array (SKA) 7] what will be reflected in its design and operations 4] . This 
science area is marked as Exploration of the Unknown what includes the likely discovery of new classes of 
objects and phenomena. Yet the statistical properties of transient radio sky remain largely unknown, what 
includes even the high-energy transients as they seem to be unfolding on very short time scales. 

In detection approach for the arrays such as MWA, LOFAR and SKA we propose to acknowledge the 
need for reducing data transport and computational cost while searching for fast transients. Therefore, we 
suggest that such detection needs to be done in stages with reduced data for storing and transporting at 
every consequent stage. Firstly, the presence of the abrupt change in raw time-domain data needs to be 
identified. Secondly, the evidences of the extraterrestrial nature of the detected signal need to be found. 
Thirdly, if such evidence is present, more detailed analysis can be applied. In this paper we present an 
attempt to develop a reliable and cost-effective technique for the first stage of detection using statistical 
signal detection approach. 



The proposed method of handling received time domain sequences is based on online abrupt changes 
detection scheme. The algorithm operates in time domain and is required to identify the onset of the signal 
by checking if the threshold of the detection process has been reached. 

Lets consider the signal with the following structure: 



Where t is time, n (t) £ A/'(/^o, ctq) is a Gaussian noise process with mean /io and variance ctq, s{t) e A^(/i, cr^) 
is a Gaussian signal with mean /i and variance a^. Let /ii — fig + ^ and — ctq + since we assume 
that signal and noise are independent. Hq and Hi are hypothesis under which the signal of interest is either 
absent (hypothesis Hq) or present (Hi) along with the additive background noise n(t). As we demonstrate 
further, the proposed utilization of algorithm known as Cumulative Sum (CUSUM) introduced by Page 
[ill] . Lorden [8|] allows for mean only parameter to be sufficient when building a detector. 



1 Introduction 



2 Detection approacli 



{ 



Ho ■■X{t)= n{t) 

Hi : X{t) = s{t)+n{t) 



(1) 



Changes detection scheme, used ino transient identification algorithm, is based on two generic detection 
procedures: log likelihood ratio (LLR) and CUSUM. 



LLR test is based on using a ratio of two pdf to build an indicator upon which a threshold can be applied. 
If the ratio exceeds given threshold, it indicates the prevalence of one of the hypothesis over another. 

LR for a Gauss-distributed data using the model given in ^ is: 

= —Tr — TT = — e ^ e ^"0 (2) 
We consider identification successful when £{x) > t], where 77 is a predefined threshold for identification. 
LLR is obtained by taking logarithm of ([2]): 



Hm) = m 4 m ^ + - (3) 



Equation ([3]) can now be expressed through either mean- or variance-based detector. 
In the latter case, assuming equal and zero mean, we obtain: 



+ (4) 



Assuming equal and unity variance, (|3]) becomes: L{x) — (/ii — no)ix — (/xq + /ii)/2). 
Thus, defining 

and rescaling the log likelihood expression by the proportionality value (/ii — /io) to obtain a threshold value. 
If the probability of Type I error a is fixed, the detection threshold can be expressed as 77 = y?r$(i-a) ^"A'l ■ 



CUSUM is a detection procedure proposed by Page Lorden [8[. CUSUM is a repeated LLR test 
for a change from one known distribution to another. We assume that for known densities /o and /i there 
exists an unknown change point v, where the input sequence X{n) G fo ii n < v and X(n) G /i if n >= v. 
The CUSUM statistics is the one that satisfies the following recursion: 

Wn = max{0, Wn^i + L„} (6) 

Where L is a likelihood ratio for /0//1 evaluated at X{n). The procedure raises an alarm at time t — 
argmin{|IV„| > \h\} for threshold h, which is discussed later. 

n 

If likelihood value from ([3]) used with regard to variances, the CUSUM recursion would take the following 
form, based on the usual radiometric output being the averaged sum of the samples' squares or the energy 
detector: 

Wn = max{0, W„_i + - r} (7) 

with the parameter r denoting variance-based coefficient r — p—^ — . Since it relies on the full knowledge 

of variances of the signal and noise, it is not always practical to use CUSUM algorithm with the recursion 
denoted in ([7]). 

Applying ([5]) CUSUM may be expressed as a procedure, which starts with VFq = 0, it recursively calcu- 
lates: 

Wn = max{0, Wn-i + X„ - m'} (8) 
and stops as soon as Wn exceeds threshold h. 



Generally speaking, equation ^ can be rewritten as W„ = maxjO, W„_i + X„ — r}, where r being a 
configurable parameter. Choice of this parameter is ruled by the expected behavior of the procedure, as 
stated by Page [Tlj : "Scoring is chosen so that the mean sample path on the chart when quality is satisfactory 
is downwards . . . and is upwards when quality is unsatisfactory" . As demonstrated by Chang f3\ the optimal 
value for parameter r will be the largest acceptable mean value or the smallest unacceptable mean value. 

One of the benefits of CUSUM scheme for signal detection is its stability in the presence of regression 
behavior in input data 0, , which is due to the gradual changes in the system temperature or changes of 
the overall sky temperature for large field of view or "all-sky" telescopes. 



3 Algorithm 

Choice between mean-based and variance-based CUSUM schemes depends on the type of the receiver 
used. For a linear output receiver, sampled according to Nyquist, the output represents voltage changes 
with time. Both mean- and variance-based indicator function can be used. However, mean-based indicator 
is preferred because it does not rely on variance-based separation of noise and signal. For full power square- 
law detector, however, mean-based indicator function becomes unusable due to the loss of information on 
mean. Therefore, variance-based CUSUM scheme should be used discounting the utilization of mean [H . 

CUSUM scheme requires a threshold value h to be chosen, which, when crossed, identifies the point of 
abrupt change in the statistical characteristics of the signal. Threshold value h can be derived from Wald 
test on the mean of a normal population Assuming that fii > /io, the value of threshold h is equivalent 
to a sequence of Wald sequential tests [ij with boundaries (0, h): 

• (9) 



o-Q (Mi - Mo) 



where a was interpreted by van Dobben de Bruyn [l^ as a crude approximation to the proportion of samples 
that trigger false alarm. This value can also be interpreted as a probability of false alarm in a traditional 

In a 
(/Ji-Mo) ■ 



sense. Since we assumed that ai — ao ^ can be rewritten as h — " 



The algorithm was tested on the data obtained from Parks radiotelescope. Input data contained 1 second 
observation from the Vela pulsar PSR J0835-4510. The observation was sampled at 1416 MHz with 64 MHz 
bandwidth. PSR J0835-4510 has a period of 89 ms. Figure [1] represents a portion of the observed data with 
the detected pulses marked by vertical red lines, with blue lines representing 0.1 second intervals. Out of 10 
impulses contained in the test data, 9 were identified. 

90% of pulses being correctly identifed prove the applicability of CUSUM procedure described above 
upon the raw, time-domain radio data. Detectability of a signal is controled by and relied upon the preset 
threshold, which is calculated based on the assumed probability distribution of data Q. 



4 Conclusion 



We have presented the algorithm, which provides reliable and computationally efficient detection of 
dispersed radio transients in time domain. The algorithm is well-suited for being implemented on FPGA 
or GPU platforms for real time detection on arrays such as SKA. Statistical methods used in the algorithm 
provide easy staging of detection process across multiple handling points giving the opportunity for significant 
reduction of data volume at each consequent stage of detection. 



Figure 1: Time/frequency representation of pulsar J0835 — 4510, observed at Parks telescope, central 
frequency 1416 MHz, bandwidth 64 MHz, recorded with 2-bit VLBI recorder. The recording started at 
2009.12.10 — 17 : 25 : hlUTC. Short bars mark 0.1s intervals. Vertical red lines mark the beginning of 
detected pulses. 
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